Comparing Separation Quality of Nonnegative Matrix Factorization and Nonnegative Matrix Factor 2D Deconvolution in Audio Source Separation Tasks

نویسندگان

  • Julian M. Becker
  • Volker Gnann
چکیده

The Nonnegative Matrix Factorization (NMF) is widely used in audio source separation tasks. However, the separation quality of NMF varies a lot depending on the mixture. In this paper, we analyze the use of NMF in source separation tasks and show how separation results can be significantly improved by using the Nonnegative Matrix Factor 2D Deconvolution (NMF2D). NMF2D was originally proposed as an extension to the NMF to circumvent the problem of grouping notes, but it is used differently in this paper to improve the separation quality, without taking the problem of grouping notes into account.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Nonnegative Tensor Factorization for Directional Blind Audio Source Separation

We augment the nonnegative matrix factorization method for audio source separation with cues about directionality of sound propagation. This improves separation quality greatly and removes the need for training data, but doubles the computation.

متن کامل

Nonnegative Tensor Factorization with Frequency Modulation Cues for Blind Audio Source Separation

We present Vibrato Nonnegative Tensor Factorization, an algorithm for single-channel unsupervised audio source separation with an application to separating instrumental or vocal sources with nonstationary pitch from music recordings. Our approach extends Nonnegative Matrix Factorization for audio modeling by including local estimates of frequency modulation as cues in the separation. This permi...

متن کامل

Block Nonnegative Matrix Factorization for Single Channel Source Separation

Nonnegative Matrix Factorization (NMF) [1, 2] has been widely used in audio research, e.g. automatic music transcription [3], musical source separation [4], and speech enhancement [5]. The key strategy for applying NMF to audio-related tasks is to find a lower rank representation of the Short Time Fourier Transformed (STFT) input signal and use the basis vectors as dictionaries. For example, in...

متن کامل

A Modified Digital Image Watermarking Scheme Based on Nonnegative Matrix Factorization

This paper presents a modified digital image watermarking method based on nonnegative matrix factorization. Firstly, host image is factorized to the product of three nonnegative matrices. Then, the centric matrix is transferred to discrete cosine transform domain. Watermark is embedded in low frequency band of this matrix and next, the reverse of the transform is computed. Finally, watermarked ...

متن کامل

A Modified Digital Image Watermarking Scheme Based on Nonnegative Matrix Factorization

This paper presents a modified digital image watermarking method based on nonnegative matrix factorization. Firstly, host image is factorized to the product of three nonnegative matrices. Then, the centric matrix is transferred to discrete cosine transform domain. Watermark is embedded in low frequency band of this matrix and next, the reverse of the transform is computed. Finally, watermarked ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012